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About the Course 


The Data Scientist Masters program has been 
designed to introduce you to the world of analytics 
and elevate your skills to ultimately become a Data 
Scientist. As a Data Scientist, you must be able to 
work with multiple data formats and have 
knowledge of the algorithms that can help you 
extract useful data. If you are pursuing a career in 
analytics, this is the program for you. 









The Data Scientist Masters 
trains you along an industry 
recommended learning 
path to succeed in the field 
of analytics 



Key Features 

Industry Recommended Learning path 
(Q) Access to 150+ hours of Instructor Led training 
Over 200 hours of high quality E-learning 
cLO Hands on project execution on CloudLabs 

An industry recognized Simplilearn Masters Certificate on completion 
|ji) 10+ industry projects 

GTA Technical, Project, Programming support by industry experts 

0 

Qj Course Advisor - Ronald van Loon 
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Business Analytics with Excel 

Analytics using Excel 


Optional Electives 






Apache Kafka 

Real Time Processing 
with Hadoop 



• Certified SAS Base 
Programmer 

• Introduction to Big Data 
and Hadoop 

• Python Basics 

• Java essentials for 
hadoop training 


DATA SCIENTIST 
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step O' 

Start your Analytics journey 


SAS 

As a first step in the field of analytics it is 
important to understand various statistical 
concepts and learn the latest technology to apply 
these tools. This is why SAS is the first step in the 
Data Scientist Masters program. After completing 
this course you will be ready to work on analytics 
projects in some of the biggest companies in the 
world, as SAS is one of the most popular Data 
Science and analytics platforms. 



Key Learning Objectives 

After completing this course, you will be able to: 

G Outline data science principles and how SAS can help implement them 

G Explain the different methods used to combine and modify datasets 

G Explain what PROC SQL is and how it’s used to retrieve data from tables 

Q Describe how to use the macro function to manipulate the character 
strings and text. 

Q List the various statistical procedures and explore the various testing 
techniques. 

G Understand how SAS handles missing values in your datasets using 
various procedures. 

G Explain the ways to create a cluster and to perform cluster analysis on 
the dataset. 

Q List the various time series models of SAS. 
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Course Curriculum 


O Analytics Overview 
G SAS Introduction 
O Combining/Modifying Datasets 
O PROC SQL 
G SAS Macros 
G Basic Statistics 
G Basic Statistical Procedures 
G Data Exploration 
G Advanced Statistical Techniques 
G Working with Time series Data 
G Data Optimization using SAS 













































STEP 

Learn the concepts of Data Science 


R 

The next step to a data scientist is learning R - 
the upcoming and most in-demand open source 
technology. R is is an extremely powerful data 
science and analytics language which has a 
steep learning curve and a very vibrant 
community. This is why it is quickly becoming 
the technology of choice for organizations who 
are adopting the power of analytics for 
competitive advantage. 



Key Learning Objectives 

Simplilearn's Data Scientist with R programming certification has a clear 

focus on the vital concepts of business analytics and R programming. By 

the end of training, participants will be able to: 

O Work on data exploration, data visualization, and predictive modeling 
techniques with ease. 

G Gain fundamental knowledge on analytics and how it assists with 
decision making. 

G Work with confidence using the R language. 

G Understand and work on statistical concepts like linear & logistic 
regression, cluster analysis, and forecasting. 

G Develop a structured approach to use statistical techniques and the R 
language. 

G Perform sharp data analysis to make business decisions." 
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Course Curriculum 


Q Introduction to Analytics 

Q Statistical Concepts And Their Application In Business 
O Basic Analytic Techniques - Using R 
G Data Exploration 
G Data Visualization 
G Diagnostic Analytics 
G Linear Regression 
G Logistic Regression 
G Cluster Analysis 
G Time Series Analysis 





STEP Q 

Harness the power of Big Data 
& Hadoop 


Big Data Hadoop and Spark Developer 

The final step to a Data Scientist is the ability to 
work with Big Data and its components. This 
training is deep dive into Hadoop and all its 
ecosystem components including MapReduce, 
HDFS, Yarn, HBase, Impala, Sqoop and Flume. It 
also provides an introduction to Apache Spark 
which is a next step after Hadoop. After 
completing this program not only will you be 
ready to enter the Big Data domain but will also 
be able to clear the in demand Cloudera CCA175 
certification. 



Key Learning Objectives 

G Master the concepts of the Hadoop framework and its deployment in a 
cluster environment 

G Understand how the Hadoop ecosystem fits in with the data processing 
lifecycle 

G Learn to write complex MapReduce programs 
G Describe how to ingest data using Sqoop and Flume 
G Get introduced to Apache Spark and its components 
G List the best practices for data storage 

G Explain how to model structured data as tables with Impala and Hive 










Course Curriculum 


Q Introduction to Bigdata and Hadoop Ecosystem 
G HDFS and Hadoop Architecture 
O MapReduce and Sqoop 
G Basics of Impala and Hive 
G Working with Impala and Hive 
G Type of Data Formats 

G Advance HIVE concept and Data File Partitioning 
G Apache Flume and HBase 
G Apache Pig 

G Basics of Apache Spark, RDDs in Spark and Applications 















































Data Science with Python 


Python is a general purpose multi-paradigm 
programming language for data science that has 
gained wide popularity because of its syntax 
simplicity and operability on different 
eco-systems.This Python course can help 
programmers play with data by allowing them to 
do anything they need- data munging, data 
wrangling, website scraping, web application 
building, data engineering and more. The Python 
language makes it easy for programmers to 
write maintainable, large scale, and robust code. 



Key Learning Objectives 

Q Gain an in-depth understanding of data science process, data wrangling, 
data exploration, data visualization, hypothesis building, and testing. 

You will also learn the basics of statistics. 

O Understand the essential concepts of Python programming like data 
types, tuples, lists, diets, basic operators, and functions. 

G Perform high-level mathematical, scientific and technical computing 
using NumPy, SciPy packages 

G Perform data analysis and manipulation using data structures and tools 
provided in Pandas package 

G Gain expertise in machine learning using the Scikit-Learn package 
















Course Curriculum 


Q Data Science Overview 
G Data Analytics Overview 
O Statistical Analysis and Business Applications 
G Python Environment Setup and Essentials 
G Mathematical Computing with Python (NumPy) 

G Scientific computing with Python (Scipy) 

G Data Manipulation with Pandas 
G Machine Learning with Scikit-Learn 
G Natural Language Processing with Scikit Learn 
G Data Visualization in Python using matplotlib 
G Web Scraping with BeautifulSoup 

G Python integration with Hadoop MapReduce and Spark 















































STEP 


© 


Business Analytics with Excel 

This is the basic step for all working 
professionals as excel is the tool of choice in 
most organizations. It is important for an analyst 
to be comfortable in excel and basic analytical 
concepts. These techniques help solve many of 
the simpler analytic problems and is an 
important skill for every analytics professional. 



Key Learning Objectives 

G Understand the meaning of business analytics and its importance in the 
industry 

G Grasp fundamentals of excel analytics functions and conditional 
formatting 

G Learn how to analyze with complex datasets using pivot tables and 
slicers 

G Solve stochastic and deterministic analytical problems using tools like 
scenario manager, solver and goal seek 

G Apply statistical tools and concepts like moving average, hypothesis 
testing, ANOVA and regression to data sets using Excel 

O Represent your findings using charts and dashboards 

G Get introduced to the latest Microsoft analytic and visualization tools i.e. 
Power Bl 










Course Curriculum 


Q Introduction to Business Analytics 

G Formatting Conditional Formatting and Important Functions 
O Analyzing Data with Pivot Tables 
G Dashboarding 
G Business Analytics With Excel 
G Data Analysis Using Statistics 
G Power Bl 
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STEP 




Apache Kafka 

This course introduces you to Apache Kafka 
which is an important high-performance 
real-time messaging system that can process 
millions of messages per second. It provides a 
distributed and partitioned messaging system 
and is highly fault tolerant. 



Key Learning Objectives 

Q Describe the importance of big data 
Q Describe the fundamental concepts of Kafka 
O Describe the architecture of Kafka 
G Explain how to install and configure Kafka 
G Explain how to use Kafka for real-time messaging 


Course Curriculum 

G Course introduction 
O Big Data Overview 
G Introduction to Zookeeper 
G Introduction to Kafka 
G Installation and Configuration 
G Kafka Interfaces 




Other Electives 


Certified SAS Base Programmer 

The SAS Base Programmer course in a beginner 
level course for a SAS professional. This training 
has been designed to enable you to start your 
analytics career with SAS and prepare for the SAS 
Base Programmer certification. This SAS course 
explores the SAS tool and different techniques to 
help you access and manage data, create data 
structures, generate reports, and handle errors. 
These techniques are mandatory for a professional 
to start working on the next SAS assignment and 
forms a strong base for advanced techniques and 
certifications. 



Introduction to Big Data and Hadoop 

This is an ideal course for individuals who want to 
understand the basic concepts of Big Data and 
Hadoop. On completing this course, learners will 
be able to interpret what goes behind the 
processing of huge volumes of data as the industry 
switches over from excel-based analytics to 
real-time analytics. 



Python Basics 

This course is ideal for you to understand the 
basics of Python Programming Language. 
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Advisory Board Members 



Named by Onalytica as one of the 3 most 
influential people in Big Data, Ronald is an author 
for a number of leading Big Data & Data Science 
websites, including Datafloq, Data Science 
Central, and The Guardian. He is also a renowned 
speaker at industry events. 

Ronald Van Loon 

Big Data Expert, Director Adversitement 



Sina has over 10 years of experience in 
Technology as a Big Data Architect at Bell Labs 
and as a Platinum level trainer. He is very 
passionate about building a Big Data education 
ecosystem and has been a contributor to a 
number of magazine and journal publications. 

Sina Jamshidi 

Big Data Lead at Bell Labs 



Simon is a Data Scientist with 12 years of 
experience in Healthcare analytics. He has a 
master's degree in Biostatistics from the 
University of Western Ontario. He is passionate 
about teaching Data Science, and has a number 
of journal publications in preventive medicince 
and data analytics. 

Simon Tavasoli 

Analytics Lead at Cancer Care Ontario 
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Paul is a lead SAS Data Scientist at the Bank of 
Montreal. As an SAS Certified Predictive Modeler, 
SAS Statistical Business Analyst, and SAS 
Certified Advanced Programmer, Paul is 
passionate about sharing his knowledge on how 
Data Science can support data-driven business 
decisions. 

Paul Sharkov 

Data Scientist at BMO Financial Group, Member 
of SAS Canada Community 


Alvaro is a Data Scientist who founded Quant 
Company. He has also worked as a lead Economic 
Analyst in the Central Bank of Guatemala. He has 
a master's degree in Quantitative Economics and 
Applied Mathematics and is actively involved in 
consulting and training in the Data Science space. 

Alvaro Fuentes 

Founder and Data Scientist at Quant Company 
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